36 research outputs found

    Facilitating functional annotation of chicken microarray data

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Modeling results from chicken microarray studies is challenging for researchers due to little functional annotation associated with these arrays. The Affymetrix GenChip chicken genome array, one of the biggest arrays that serve as a key research tool for the study of chicken functional genomics, is among the few arrays that link gene products to Gene Ontology (GO). However the GO annotation data presented by Affymetrix is incomplete, for example, they do not show references linked to manually annotated functions. In addition, there is no tool that facilitates microarray researchers to directly retrieve functional annotations for their datasets from the annotated arrays. This costs researchers amount of time in searching multiple GO databases for functional information.</p> <p>Results</p> <p>We have improved the breadth of functional annotations of the gene products associated with probesets on the Affymetrix chicken genome array by 45% and the quality of annotation by 14%. We have also identified the most significant diseases and disorders, different types of genes, and known drug targets represented on Affymetrix chicken genome array. To facilitate functional annotation of other arrays and microarray experimental datasets we developed an Array GO Mapper (<it>AGOM</it>) tool to help researchers to quickly retrieve corresponding functional information for their dataset.</p> <p>Conclusion</p> <p>Results from this study will directly facilitate annotation of other chicken arrays and microarray experimental datasets. Researchers will be able to quickly model their microarray dataset into more reliable biological functional information by using <it>AGOM </it>tool. The disease, disorders, gene types and drug targets revealed in the study will allow researchers to learn more about how genes function in complex biological systems and may lead to new drug discovery and development of therapies. The GO annotation data generated will be available for public use via AgBase website and will be updated on regular basis.</p

    GIFtS: annotation landscape analysis with GeneCards

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Gene annotation is a pivotal component in computational genomics, encompassing prediction of gene function, expression analysis, and sequence scrutiny. Hence, quantitative measures of the annotation landscape constitute a pertinent bioinformatics tool. GeneCards<sup>® </sup>is a gene-centric compendium of rich annotative information for over 50,000 human gene entries, building upon 68 data sources, including Gene Ontology (GO), pathways, interactions, phenotypes, publications and many more.</p> <p>Results</p> <p>We present the GeneCards Inferred Functionality Score (GIFtS) which allows a quantitative assessment of a gene's annotation status, by exploiting the unique wealth and diversity of GeneCards information. The GIFtS tool, linked from the GeneCards home page, facilitates browsing the human genome by searching for the annotation level of a specified gene, retrieving a list of genes within a specified range of GIFtS value, obtaining random genes with a specific GIFtS value, and experimenting with the GIFtS weighting algorithm for a variety of annotation categories. The bimodal shape of the GIFtS distribution suggests a division of the human gene repertoire into two main groups: the high-GIFtS peak consists almost entirely of protein-coding genes; the low-GIFtS peak consists of genes from all of the categories. Cluster analysis of GIFtS annotation vectors provides the classification of gene groups by detailed positioning in the annotation arena. GIFtS also provide measures which enable the evaluation of the databases that serve as GeneCards sources. An inverse correlation is found (for GIFtS>25) between the number of genes annotated by each source, and the average GIFtS value of genes associated with that source. Three typical source prototypes are revealed by their GIFtS distribution: genome-wide sources, sources comprising mainly highly annotated genes, and sources comprising mainly poorly annotated genes. The degree of accumulated knowledge for a given gene measured by GIFtS was correlated (for GIFtS>30) with the number of publications for a gene, and with the seniority of this entry in the HGNC database.</p> <p>Conclusion</p> <p>GIFtS can be a valuable tool for computational procedures which analyze lists of large set of genes resulting from wet-lab or computational research. GIFtS may also assist the scientific community with identification of groups of uncharacterized genes for diverse applications, such as delineation of novel functions and charting unexplored areas of the human genome.</p

    Re-Annotation Is an Essential Step in Systems Biology Modeling of Functional Genomics Data

    Get PDF
    One motivation of systems biology research is to understand gene functions and interactions from functional genomics data such as that derived from microarrays. Up-to-date structural and functional annotations of genes are an essential foundation of systems biology modeling. We propose that the first essential step in any systems biology modeling of functional genomics data, especially for species with recently sequenced genomes, is gene structural and functional re-annotation. To demonstrate the impact of such re-annotation, we structurally and functionally re-annotated a microarray developed, and previously used, as a tool for disease research. We quantified the impact of this re-annotation on the array based on the total numbers of structural- and functional-annotations, the Gene Annotation Quality (GAQ) score, and canonical pathway coverage. We next quantified the impact of re-annotation on systems biology modeling using a previously published experiment that used this microarray. We show that re-annotation improves the quantity and quality of structural- and functional-annotations, allows a more comprehensive Gene Ontology based modeling, and improves pathway coverage for both the whole array and a differentially expressed mRNA subset. Our results also demonstrate that re-annotation can result in a different knowledge outcome derived from previous published research findings. We propose that, because of this, re-annotation should be considered to be an essential first step for deriving value from functional genomics data

    Quality of Computationally Inferred Gene Ontology Annotations

    Get PDF
    Gene Ontology (GO) has established itself as the undisputed standard for protein function annotation. Most annotations are inferred electronically, i.e. without individual curator supervision, but they are widely considered unreliable. At the same time, we crucially depend on those automated annotations, as most newly sequenced genomes are non-model organisms. Here, we introduce a methodology to systematically and quantitatively evaluate electronic annotations. By exploiting changes in successive releases of the UniProt Gene Ontology Annotation database, we assessed the quality of electronic annotations in terms of specificity, reliability, and coverage. Overall, we not only found that electronic annotations have significantly improved in recent years, but also that their reliability now rivals that of annotations inferred by curators when they use evidence other than experiments from primary literature. This work provides the means to identify the subset of electronic annotations that can be relied upon—an important outcome given that >98% of all annotations are inferred without direct curation

    Comparative Developmental Expression Profiling of Two C. elegans Isolates

    Get PDF
    Gene expression is known to change during development and to vary among genetically diverse strains. Previous studies of temporal patterns of gene expression during C. elegans development were incomplete, and little is known about how these patterns change as a function of genetic background. We used microarrays that comprehensively cover known and predicted worm genes to compare the landscape of genetic variation over developmental time between two isolates of C. elegans. We show that most genes vary in expression during development from egg to young adult, many genes vary in expression between the two isolates, and a subset of these genes exhibit isolate-specific changes during some developmental stages. This subset is strongly enriched for genes with roles in innate immunity. We identify several novel motifs that appear to play a role in regulating gene expression during development, and we propose functional annotations for many previously unannotated genes. These results improve our understanding of gene expression and function during worm development and lay the foundation for linkage studies of the genetic basis of developmental variation in gene expression in this important model organism

    Proteomics-Based Systems Biology Modeling of Bovine Germinal Vesicle Stage Oocyte and Cumulus Cell Interaction

    Get PDF
    BACKGROUND: Oocytes are the female gametes which establish the program of life after fertilization. Interactions between oocyte and the surrounding cumulus cells at germinal vesicle (GV) stage are considered essential for proper maturation or 'programming' of oocytes, which is crucial for normal fertilization and embryonic development. However, despite its importance, little is known about the molecular events and pathways involved in this bidirectional communication. METHODOLOGY/PRINCIPAL FINDINGS: We used differential detergent fractionation multidimensional protein identification technology (DDF-Mud PIT) on bovine GV oocyte and cumulus cells and identified 811 and 1247 proteins in GV oocyte and cumulus cells, respectively; 371 proteins were significantly differentially expressed between each cell type. Systems biology modeling, which included Gene Ontology (GO) and canonical genetic pathway analysis, showed that cumulus cells have higher expression of proteins involved in cell communication, generation of precursor metabolites and energy, as well as transport than GV oocytes. Our data also suggests a hypothesis that oocytes may depend on the presence of cumulus cells to generate specific cellular signals to coordinate their growth and maturation. CONCLUSIONS/SIGNIFICANCE: Systems biology modeling of bovine oocytes and cumulus cells in the context of GO and protein interaction networks identified the signaling pathways associated with the proteins involved in cell-to-cell signaling biological process that may have implications in oocyte competence and maturation. This first comprehensive systems biology modeling of bovine oocytes and cumulus cell proteomes not only provides a foundation for signaling and cell physiology at the GV stage of oocyte development, but are also valuable for comparative studies of other stages of oocyte development at the molecular level
    corecore